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ABSTRACT 

A magnitude limited sample of nearly 9000 early-type galaxies, in the redshift range 0.01 < 
z < 0.3, was selected from the Sloan Digital Sky Survey using morphological and spectral criteria. 
The sample was used to study how early-type galaxy observables, including luminosity L, effective 
radius R Q , surface brightness I 0l color, and velocity dispersion a, are correlated with one another. 
Measurement biases are understood with mock catalogs which reproduce all of the observed 
scaling relations and their dependences on fitting technique. At any given redshift, the intrinsic 
distribution of luminosities, sizes and velocity dispersions in our sample are all approximately 
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Gaussian. A maximum likelihood analysis shows that a oc £°- 2 5±o.oi2^ ^ K ^o.63±o.o25^ an j 
R oc /-°- 75 ± 02 i n the r* band. In addition, the mass-to-light ratio within the effective radius 
scales as M D /L oc 7 y 014 ± 02 r M a /L oc M" 22±0 05 , and galaxies with larger effective masses have 
smaller effective densities: A Q oc M~°- 52±0 03 . These relations are approximately the same in the 
g* , i* and z* bands. Relative to the population at the median redshift in the sample, galaxies at 
lower and higher redshifts have evolved only little, with more evolution in the bluer bands. The 
luminosity function is consistent with weak passive luminosity evolution and a formation time of 
about 9 Gyrs ago. 

Subject headings: galaxies: elliptical — galaxies: evolution — galaxies: fundamental parameters 
— galaxies: photometry — galaxies: stellar content 



1. Introduction 

This is the second of four papers in which the properties of ~ 9000 early-type galaxies, in the redshift 
range 0.01 < z < 0.3 are studied. Paper I (Bcrnardi et al. 2003a) describes how the sample was selected 
from the SDSS database. The sample is essentially magnitude limited, and the galaxies in it span a wide 
range of environments. Each galaxy in the sample has measured values of luminosity L, effective radius R 
and surface brightness I = (L/2)/Rl in four bands (g* , r* , i* and z*), a velocity dispersion a, a redshift, 
and an estimate of the local density. 

Section 2 of the present paper shows that the luminosity function of the galaxies in our sample, when 
expressed as a function of absolute magnitude, is well described by a Gaussian form, and that the lumi- 
nosities in the population as a whole appear to be evolving passively. Section 3 studies the distribution of 
(the logarithm of) velocity dispersion, size, surface-brightness, effective mass and effective density at fixed 
luminosity; all of these are quite well described by Gaussian forms, suggesting that the intrinsic distributions 
of log(size) and log(velocity dispersion) are, like the distribution of log(luminosity), approximately Gaussian. 
Maximum-likelihood estimates of these and other correlations, which include the Faber- Jackson relation, the 
mass-to-light ratio, the Kormendy relation and a mass-density relation are presented in Section 4. Ap- 
pendix A describes a method for generating accurate mock complete and magnitude-limited galaxy catalogs, 
which are useful for assessing the relative importance of evolution and selection effects. The procedure used 
to estimate errors on our results is discussed in Appendix B. 

Paper III (Bernardi et al. 2003b) of this series places special emphasis on the Fundamental Plane relation 
between size, surface brightness and velocity dispersion. It shows how the FP depends on waveband, color, 
redshift and environment. Paper IV (Bernardi et al. 2003c) uses the colors and spectra of these galaxies to 
provide information about the chemical evolution of the early-type population. 

Except where stated otherwise, we write the Hubble constant as H = 100 h kms" 1 Mpc -1 , and we 
perform our analysis in a cosmological world model with (^m,^a,^) = (0.3,0.7,0.7), where f^M and f^A 
are the present-day scaled densities of matter and cosmological constant. In such a model, the age of 
the Universe at the present time is t n = 9.43/i _1 Gyr. For comparison, an Einstein-de Sitter model has 
(f^M, ^a) = (1, 0) and t = 6.52/i _1 Gyr. We frequently use the notation h^o as a reminder that we have set 
h = 0.7. Also, we will frequently be interested in the logarithms of physical quantities. Our convention is to 
set R = log 10 R and V = log 10 a, where R and a are effective radii in h^Q kpc and velocity dispersions in 
km s _1 , respectively. 
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2. The luminosity function 

Our sample is magnitude limited (Table 1 of Paper I gives the magnitude limits in the different bands). 
Therefore, we measure the luminosity function of the galaxies in our sample using two techniques. The first 
uses volume limited catalogs, and the second uses a maximum likelihood procedure (Sandage, Tammann & 
Yahil 1979; Efstathiou, Ellis & Peterson 1988). 

In the first method, we divide our parent catalog into many volume limited subsamples; this was possible 
because the parent catalog is so large. When doing this, we must decide what size volumes to choose. We 
would like our volumes to be as large as possible so that each volume represents a fair sample of the 
Universe. On the other hand, the volumes must not be so large that evolution effects are important. In 
addition, because our catalog is cut at the bright as well as the faint end, large- volume subsamples span only 
a small range in luminosities. Therefore, we are forced to compromise: we have chosen to make the volumes 
about Az = 0.04 thick, because cAz/H w 120/i _1 Mpc is larger than the largest structures seen in numerical 
simulations of the cold dark matter family of models (e.g., Colberg et al. 2000). The catalogs are extracted 
from regions which cover a very wide angle on the sky, so the actual volume of any given volume limited 
catalog is considerably larger than (120/i _1 Mpc) 3 . Therefore, this choice should provide volumes which are 
large enough in at least two of the three coordinate directions that they represent fair samples, but not so 
large in the redshift direction that the range in luminosities in any given catalog is small, or that evolution 
effects are washed out. 

The volume-limited subamples are constructed as follows. First, we specify the boundaries in redshift 
of the catalog: z m ; n and z max = z m ; n + 0.04. In the context of a world model, these redshift limits, when 
combined with the angular size of the catalog, can be used to compute a volume. This volume depends 
on z m j n ,z max and the world model: as our fiducial model we set CIm = 0.3 and Q\ = 1 — Qm- (Our 
results hardly change if we use an Einstein de-Sitter model instead.) We then compute the K-corrected 
limiting luminosities I ma x(-Zmin) and £ m ; n (z max ) given the apparent magnitude limits, the redshift limits, 
and the assumed cosmology. A galaxy i is included in the volume limited subsample if z m ; n < z% < z max and 
imin < Li < £ m ai- The luminosity function for the volume limited subsample is obtained by counting the 
number of galaxies in a luminosity bin and dividing by the volume of the subsample. 

The top panels in Figure 1 show the result of doing this in the g* and r* bands. Stars, circles, diamonds, 
triangles, squares and crosses show measurements in volume limited catalogs which have z m i n — 0.04, 0.08, 
0.12, 0.16, 0.20, and 0.24 and z max = z m ; n + 0.04. Each subsample contains more than five hundred galaxies, 
except for the two most distant, which each contain about one hundred. As one would expect, the nearby 
volumes provide the faint end of <j)(M), and the more distant volumes show the bright end. The extent 
to which the different volume limited catalogs all trace out the same curve is a measure of how little the 
luminosity function at low and high rcdshifts differs from that at the median redshift. 

The bottom panels in Figure 1 show evidence that, in fact, the galaxies in our data show evidence for 
a small amount of evolution: at fixed comoving density, the higher redshift population is slightly brighter 
than that at lower redshifts. Although volume- limited catalogs provide model- independent measures of this 
evolution, the test is most sensitive when a large range of luminosities can be probed at two different redshifts. 
Because the SDSS catalogs are cut at both the faint and the bright ends, our test for evolution is severely 
limited. Nevertheless, the small trends we see are both statistically significant, and qualitatively consistent 
with what one expects of a passively evolving population. (Note that our sample contains only early-type 
galaxies. Blanton et al. 2001 study the luminosity function in an SDSS sample which contains all galaxy 
types, but they ignore evolution effects. Since late-type galaxies are expected to evolve more rapidly than 
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Fig. 1. — Luminosity functions in the g* and r* bands. Stars, circles, diamonds, triangles, squares and 
crosses show measurements in volume limited catalogs which are adjacent in redshift of width Az = 0.04, 
starting from a minimum of z m ; n = 0.04. Top panels show that the higher redshift catalogs contribute at 
the bright end only. At the same comoving density, the symbols which represent the higher redshift catalogs 
tend to be displaced slightly to the left of the those which represent the lower redshift catalogs. Bottom 
panels show this small mean shift towards increasing luminosity with increasing redshift. 
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early-types, it is important to redo Blanton et al.'s analysis after allowing for evolution.) 

Before we make more quantitative conclusions, notice that a bell-like Gaussian shape would provide a 
reasonable description of the luminosity function. Although early-type galaxies are expected to have red 
colors, our sample was not selected using any color information. It is reassuring, therefore, that the Gaussian 
shape we find here also provides a good fit to the luminosity function of the redder objects in the SDSS 
parent catalog (see the curves for the two reddest galaxy bins in Fig. 14 of Blanton et al. 2001). A Gaussian 
form also provides a reasonable description of the luminosity function of early-type galaxies in the CNOC2 
survey (Lin et al. 1999, even though they actually fit a Schechter function to their measurements). The 
2dFGRS galaxies classified as being of Type 1 by Madgwick et al. (2002) should be similar to early-types. 
Their Type l's extend to considerably fainter absolute magnitudes than our sample and the shape of the 
luminosity function they report is quite different from ours. This is probably because the population of 
early- type galaxies at faint absolute magnitudes is quite different from the brighter ones (e.g., Sandage & 
Perelmuter 1990). In any case, their Schechter function fits underestimate the number density of luminous 
Type 1 galaxies — a Gaussian tail would provide a significantly better fit. 

Given that the Gaussian form provides a good description of our data, we use the maximum- likelihood 
method outlined by Sandage, Tammann & Yahil (1979) to estimate the parameters of the best-fitting lumi- 
nosity function. For magnitude limited samples which are small and shallow, this is the method of choice. 
For a sample such as ours, which spans a sufficiently wide range in redshifts that evolution effects might be 
important, the method requires a model for the evolution. We parametrize the luminosity evolution similarly 
to Lin et al. (1999). That is to say, if we were solving only for the luminosity function, then the likelihood 
function we maximize would be 



M m i n (zi) and M max (zi) denote the minimum and maximum absolute magnitudes at Zi which satisfy the 
apparent magnitude limits of the survey, and i runs over all the galaxies in the catalog. (At small z, this 
parametrization of the evolution in absolute magnitude implies that the luminosity evolves as L»(z)/L*(0) w 
(1 + z) q , with q = Q ln(10)/2.5. Note that, in assuming that only M* evolves, this model assumes that there 
is no differential evolution in luminosities, i.e., that luminous and not so luminous galaxies evolve similarly. 

Figure 2 shows the result of estimating the luminosity function in this way in the g*, r* , i* and z* 
bands. Later in this paper, we will solve simultaneously for the joint distribution of luminosity, size and 
velocity dispersion; it is the parameters which describe the luminosity function of this joint solution which 
are shown in Fig. 2. The dashed lines in each panel show the Gaussian shape of the luminosity function at 
redshift z = 0. For comparison, the symbols show the measurements in the same volume limited catalogs 
as before, except that now we have subtracted the maximum likelihood estimate of the luminosity evolution 
from the absolute magnitudes M before plotting them. If the model for the evolution is accurate, then the 
different symbols should all trace out the same smooth dashed curve. 

The comoving number density of the galaxies in this sample is 0* = 5.8 ±0.3 x 10~ 3 /i 3 Mpc~ 3 in all four 
bands. Because the different bands have different apparent magnitude limits, and they were fit independently 
of each other, it is reassuring that the same value of works for all the bands. For similar reasons, it is 
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Fig. 2. — Luminosity functions in the g*, r*, i* and z* bands, corrected for pure luminosity evolution. 
Symbols with error bars show the estimates from our various volume limited catalogs; the higher redshift 
catalogs contribute at the bright end only. Dashed curves show the shape of the Gaussian shaped luminosity 
function which maximizes the likelihood of seeing this data at redshift z = 0. 
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Fig. 3. — The number of galaxies as a function of redshift in our sample. Solid curves show the predicted 
counts if the comoving number densities are constant, but the luminosities brighten systematically with 
redshift: M*(z) = M*(0) — Qz with Q given by the previous figure. Dashed curves show what one predicts 
if there is no evolution whatsoever, and the luminosity function is fixed to the value it has at the median 
redshift of our sample (z = 0.1). 
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reassuring that the best-fit values of M* imply rest-frame colors at z — of g* — r* = 0.72, r* — i* = 0.34, 
and r* — z* = 0.68, which are close to those of the models which we used to compute our K-corrections 
(Appendix A of Paper I), even though no a priori constraint was imposed on what these rest- frame colors 
should be. 

The histograms in each of the four panels of Figure 3 show the number of galaxies observed as a function 
of redshift in the four bands. The peak in the number counts at z ~ 0.08 is also present in the full SDSS 
sample, which includes late-types, and, perhaps more surprisingly, an overdensity at this same redshift is 
also present in the 2dF Galaxy Redshift Survey. (The second bump at z ~ 0.13 is also present in the 
2dFGRS counts.) The solid curves show what we expect to see for the evolving Gaussian function fits — the 
curves provide a reasonably good fit to the observed counts, although they slightly overestimate the numbers 
at high redshift in the redder wavebands. For comparison, the dashed curves show what is expected if the 
luminosities do not evolve and the no-evolution luminosity function is given by the one at the median redshift 
(i.e., a Gaussian with mean M* — 0.1Q). Although the fit to the high-redshift tail is slightly better, this no 
evolution model cannot explain the trends shown in the bottom panel of Figure 1. Moreover, a Bruzual & 
Chariot (2003) passive evolution model with a formation time of 9 Gyrs ago, predicts that the rest-frame 
luminosities at redshift z = 0.2 should be brighter than those at z = by 0.3, 0.26, 0.24, and 0.21 mags in 
g* , r* , i* and z* respectively — not far off from what we estimate. 

The bottom panels in Figure 1 suggest two possible reasons why our model of pure luminosity evolution 
overestimates dN/dz at higher z. One possibility is that the comoving number densities are decreasing slightly 
with redshift. A small amount of density evolution is not unexpected, because early-type galaxy morphologies 
may evolve (van Dokkum & Franx 2001), and our sample is selected on the basis of a fixed morphology. If 
we allow a small amount of density as well as luminosity evolution, and we use <f>*(z) — lO a4Pz 0*(O) with 
P w —2, as suggested by the results of Lin et al. (1999), then the resulting dN/dz curves are also well fit by 
the dashed curves. A second possibility follows from the fact that we only observe the most luminous part 
of the higher redshift population. If the most luminous galaxies at any given time are also the oldest, then 
one might expect the bright end of the luminosity function to evolve less rapidly than the fainter end. The 
curvature seen in the bottom panel of Figure 1 suggests that although the evolution of the fainter objects 
in our sample (which we only see out to low redshifts) is consistent with formation times of 9 Gyrs ago, the 
brighter objects are not. Models of differential evolution in the luminosities also predict dN/dz distributions 
which are in better agreement with the observed counts at high redshift. Since the evolution of the luminosity 
function is small, we prefer to wait until we are able to make more accurate K-corrections before accounting 
for either of these other possibilities more carefully Therefore, in what follows, we will continue to use the 
model with pure luminosity evolution. 

Repeating the exercise described above but for an Einstein-de-Sitter model yields qualitatively similar 
results, although the actual values of M* and (p* are slightly different. At face value, the fact that we see 
so little evolution in the luminosities argues for a relatively high formation redshift: the Bruzual & Chariot 
(2003) models indicate that tf olm ~ 9 Gyrs. 

3. Observed correlations: Distributions at fixed luminosity 

This Section presents scatter plots between different observables X and luminosity. This is done because, 
except for a cut at small velocity dispersions, our sample was selected by luminosity alone. This means that 
the distributions of X at fixed luminosity are not biased by the selection cut (e.g., Schechter 1980). The 
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distribution of X at fixed L is shown to be reasonably well described by a Gaussian for all the choices of 
X we consider. This simplifies the maximum likelihood analysis described in Section 4 which we use to 
estimate a number of observed correlations (it is also used in Paper III to estimate the parameters of the 
Fundamental Plane). 

The best way to think of any absolute magnitude M versus X scatter plot is to imagine that, at fixed 
absolute magnitude M, there is a distribution of X values. The scatter plot then shows the joint distribution 

<P(M,X\z)dMdX = dM(f>(M\z) p(X\M,z)dX, (2) 

where <f>{M, X\z) denotes the density of galaxies with X and M at z, and 4>(M\z) is the luminosity function 
at z which we computed in Section 2. One of the results of this section is to show that the shape of p(X\M, z) 
is simple for most of the relations of interest. 

The mean value of X at fixed M is independent of the fact that our catalogs are magnitude limited. 
Therefore, we estimate the parameters of linear relations of the form: 

(X-X.) = -°- 4 <7 M -> . (3) 

where M = — 2.51og 10 L is the absolute magnitude and X is the observable (for example, we will study 
X = log 10 a, log 10 R or [i Q = — 2.5 log 10 I ). For each volume limited catalog, we fit for the slope S and 
zero-point of the linear relation. If there really were a linear relation between M and X, and neither X nor 
M evolved, then the slopes and zero-points of the different volume limited catalogs would be the same. 

To illustrate, the different symbols in Figure 4 show (\og 1Q a\M) , the Faber-Jackson relation (Faber & 
Jackson 1976), in our dataset. Most datasets in the literature are consistent with the scaling (cr\L) oc L 1 / 4 , 
approximately independent of waveband. For example, Forbes & Ponman (1999), using a compilation of 
data from Prugniel & Simicn (1996) report L oc cr 3 92 in the B-band. At longer wavelengths Pahre et al. 
(1998) report L K cx ^^±0.22 in thc K -band, with a scatter of 0.93 mag. 

Stars, circles, diamonds, triangles, squares and crosses show the relation measured in volume limited 
catalogs of successively higher redshift (redshift limits are the same as in Figure 1). The galaxies in each 
subsample were further divided into two equal-sized parts based on luminosity. The symbols with error bars 
show the mean log 10 a for each of these small bins in M, and the rms spread around it (note that the error 
on the mean is smaller than the size of the symbols in all but the highest redshift catalogs). The solid line 
shows the maximum-likelihood estimate of the slope of this relation at z = 0, which we describe in Section 4. 
Comparison with this line shows that the higher redshift population is slightly brighter. The slope of this line 
is shown in the top of each panel: a cx L 1 / 4 , approximately, in all the bands, consistent with the literature. 
The zero point, however, is different; at fixed luminosity, the objects in our sample have velocity dispersions 
which are smaller than those reported in the literature by about log w a — 0.05. 

We have enough data that we can actually do more than simply measure the mean X at fixed M; 
we can also compute the distribution around the mean. If we do this for each catalog, then we obtain 
distributions which are approximately Gaussian in shape, with dispersions which depend on the range of 
luminosities which are in the subsample. Rather than showing these, we created a composite catalog by 
stacking together the galaxies from the nonoverlapping volume limited catalogs, and we then divided the 
composite catalog into five equal sized bins in luminosity. The histograms in the bottom of the plot show 
the shapes of the distribution of velocities in the different luminosity bins. Except for the lowest and highest 
redshift catalogs for which the statistics are poorest, the different distributions have almost the same shape; 
only the mean changes. 



-10- 



o 
1> 



O 




2 2.2 2.4 2.6 2 2.2 2.4 2.6 
Log 10 cj [kms" 1 ] 



Fig. 4. — Relation between luminosity L and velocity dispersion a. Stars, circles, diamonds, triangles, 
squares and crosses show the error-weighted mean value of log 10 <J for a small range in luminosity in each 
volume limited catalog (see text for details). (Only catalogs containing more than one hundred galaxies are 
shown.) Error bars show the rms scatter around this mean value. Solid line shows the maximum-likelihood 
estimate of this relation, and the label in the top left shows the scaling it implies. Histograms show the 
distribution of log 10 a in small bins in luminosity. They were obtained by stacking together non-overlapping 
volume limited catalogs to construct a composite catalog, and then dividing the composite catalog into five 
equal size bins in luminosity. 



- 11 - 




Fig. 5. — Same as previous figure, but for the relation between luminosity L and effective radius R, 
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One might have worried that the similarity of the distributions is a signature that they are dominated by 
measurement error. This is not the case: the typical measurement error is about a factor of two smaller than 
the rms of any of these distributions. If we assume that the measurement errors are Gaussian-distributed, 
then the distributions we see should be the true distribution broadened by the Gaussian from the measure- 
ment errors. The fact that the observed distributions are well approximated by Gaussians suggests that 
the true intrinsic distributions are also Gaussian. The fact that the width of the intrinsic distribution is 
approximately independent of M considerably simplifies the maximum likelihood analysis presented in the 
next section. 

It is well known that color is strongly correlated with velocity dispersion (Paper IV of this series shows 
the color— a relation in our sample). One consequence of this is that residuals from the a — L relation shown 
in Figure 4 correlate strongly with color: at fixed magnitude, the redder galaxies have the highest velocity 
dispersions. In addition, as a whole, the reddest galaxies populate the high a part of the relation. Forbes 
& Ponman (1999) reported that residuals from the Faber-Jackson relation correlate with age. If color is 
an indicator of age and/or metallicity, then our finding is qualitatively consistent with theirs: the typical 
age/metallicity varies along the Faber-Jackson relation. 

A similar study of the relation between the luminosities and sizes of galaxies is shown in Figure 5. 
Schade et al. (1997) find oc R^ 3 in the B band, whereas, at longer wavelengths, Pahre et al. (1998) 
find Lk oc Rl' 4 with an rms of 0.88 mag. This suggests that the relation depends on wavelength. We 
find (R \L) oc L 2 / 3 in g* , but (R \L) oc L 3 ^ 5 in the other bands. The distribution p(log 10 R \M) is also 
reasonably well fit by a Gaussian, with a mean which increases with luminosity, and a dispersion which is 
approximately independent of M. The rms around the mean is about one and a half times larger than the 
rms around the mean a — L relation. We argue in Paper IV that the color-magnitude and color-size relations 
are a consequence of the color— a correlation. If this is correct, then residuals from the R Q — L relation, 
should not correlate with size or magnitude. We have checked that this is correct, although we have not 
included a plot showing this explicitly. 

There is an interesting correlation between the residuals of the Faber-Jackson and R — L relations. At 
fixed luminosity, galaxies which are larger than the mean (R \L) tend to have smaller velocity dispersions. 
This is shown in Figure 6, which plots the residuals from the a — L relation versus the residuals from the 
R — L relation. The short dashed lines show the forward and inverse fits to this scatter plot. The long- 
dashed line in between the other two shows Ar\m/&r\m = —A v \m/^v\m, where A x \m denotes the residual 
from the mean relation at fixed M, and Gx\m denotes the rms of this residual. The anti-correlation is 
approximately the same for all L. 

This suggests that a plot of L versus some combination of R Q and a should have considerably less scatter 
than either of the two individual relations. To illustrate, Figure 7 shows the distribution of the combination 
R cr 2 at fixed L. The scatter in L is significantly reduced, making the mean trend of increasing R a 2 with 
increasing L quite clean. (The combination of observables for which the scatter is minimized is discussed 
in Section 4.) This particular combination defines an effective mass: M a = 2R a 2 /G. In slightly more 
convenient units, this mass is 

(ToTC)=°-«(^)(lSn^) 2 - < 4) 

(Because many of our galaxies are not spherical, some of their support must come from rotation, and so 
ignoring rotation as we are doing is likely to mis-estimate the true mass. See Bender, Burstein & Faber 1992 
for one way to account for this. This quantity will also mis-estimate the mass if some of the support comes 
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Fig. 6. — Residuals of the R a — L relation are anti-correlated with residuals of the a — L relation; galaxies 
of the same luminosity which are smaller than expected have larger velocity dispersions than expected. Plot 
shows the residuals normalized by their rms value. Short-dashed lines show forward and inverse fits to the 
scatter plots, and long-dashed line in between the other two shows A Ro \ M /a R \ M = — A a -M /&v\m 
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Fig. 7. — Same as Figure 4, but for the relation between luminosity L and the combination R a 2 , which is 
supposed to be a measure of mass. 



- 15 - 




Fig. 8. — Same as previous figure, but for the relation between luminosity L and the combination (a/R ) 2 , 
which is supposed to be a measure of density. 
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Fig. 9. — Relation between luminosity L and surface brightness fi in different volume limited catalogs (higher 
redshift catalogs contribute points to the upper-left corners of each plot) . Passive evolution of luminosities 
would shift points upwards and to the right of the zero-redshift relation, but, the slope of the relation should 
remain unchanged. This shift has been subtracted. 
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l)=S.r,7/, 7() ( ^X'^ J~ i:i ' • ( 5 ) 



from anisotropic velocity dispersions.) 

Fiducial values of the effective mass-to-light ratio can be obtained by inserting the maximum likelihood 
values from Table 1 into this relation. This yields M a ps lO 1O 56 /i 7 r o 1 M0 (we used the parameters for the r* 
band, for which R* + 2V* ~ 4.89). The corresponding total absolute magnitude is M* — 5 log 10 /170 ~ —21.15. 
The luminosity of the sun in r* is 4.62 mags, so « 10 10 31 /if 2 L & . The luminosity within the effective 
radius is half this value, so that the effective mass-to-light ratio within the effective radius of an L* object 
is 2/170 x io 10 - 56-10 - 31 rj 3.57/170 times that of the sun. Figure 7 shows that the effective mass-to-light ratio 
depends on luminosity: 

•Mo 
L 

At larger radii, the luminosity can double at most, whereas, if the galaxy is embedded in a dark matter halo, 
the mass at large radii may continue to increase. For this reason one might expect the mass-to-light ratios 
to be significantly larger at larger radii. 

Since the ratio (2R al/G)/(L/2) above is the mass-to- light ratio at the radius which encloses half the 
light, it is tempting to associate it with the mass-to- light ratio at the half mass radius. Because both the 
numerator and the denominator are projected quantities, this is incorrect. For example, if the mass-to-light 
ratio is independent of distance from the galaxy center, then the three dimensional half-mass radius is about 
30% larger than the projected half-light radius (e.g. Hernquist 1990). If the velocity dispersion does not 
change substantially over the range in radii which contribute light, then a fairer estimate of the mass-to-light 
ratio within the half-mass radius would be about 30% larger than the value given above. 

We can define an effective density by setting ZM jAi:R z = (2i? cr 2 /G)/(47ri? 3 /3) = A D p crit , with 
p clit = 3H 2 /8irG, then 

2 / , -1 , x 2 



A = 4( — ) = 4xl0 6 x 0.7 2 ( 'J2°-P^) . ( 6 ) 

\HRoJ VlOOkms- 1 Ro ) K ' 

Figure 8 shows that this effective density decreases with increasing luminosity, although the scatter in 
densities at fixed luminosity is quite large (~ 0.32 dex). Inserting mean values for a and R yields 

A = 5.16 x 10 5 (-j^L.) = 5.16xl0 5 (-^ in r* . (7) 



a h 7 } kpc 



R j R* J \ 

Such a trend is qualitatively similar to that seen in numerical simulations of dissipationless gravitational 
clustering: the central densities of virialized halos in such simulations are smaller in the more massive halos 
(Navarro, Frenk & White 1997). 

Figure 9 shows a final relation at fixed luminosity: the surface-brightness— L relation. In such a plot, 
luminosity evolution moves objects upwards and to the right (larger luminosities and surface brightnesses at 
high redshift), so that the higher redshift population should be obviously displaced from the zero-redshift 
relation. The plot shows the distribution of fi a and M after subtracting the maximum likelihood estimate 
of the evolution from both quantities. The solid line shows the maximum likelihood value of the slope of 
this relation. This differs slightly from the / oc L~ 0A5 scaling Sandage & Perelmuter (1990) find for giant 
galaxies with Mb < —20, although the scatter around the mean relation of ~ 0.58 mags is similar. This 
relation is considerably broader than any of the others we have studied so far, which may account for some 
of the difference. However, a careful inspection of the figure suggests that the relation is becoming shallower 
at high redshift; whether or not this is a signature of differential evolution in the luminosities is the subject 
of work in progress. 
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4. A parametric maximum-likelihood analysis 

Section 2 showed that, after accounting for the fact that the SDSS sample is magnitude-limited, the 
distribution of M = — 2.51og 10 L is quite well described by a Gaussian. In principle, by extending the 
Efstathiou, Ellis & Peterson (1988) method (along the lines described by Sodre & Lahav 1993) we could 
derive non-parametric maximum-likelihood estimates of the three-dimensional distribution of L, R Q and a. 
The virtue of this approach is that it accounts for the fact that the observed sample is magnitude-limited, 
that there is also a cut at small velocity dispersions, and that there are correlated measurement errors 
associated with the luminosities, sizes and velocity dispersions. Once the shape of the three-dimensional 
distribution has been estimated, it is straightforward to obtain estimates of the various correlations with 
luminosity we studied in the previous section. This is the subject of Section 4.2. However, the real benefit 
of the maximum likelihood analysis is that it also yields estimates of correlations between observables which 
do not include luminosity — some examples of these are shown in Section 4.3. 

We chose not to make a non-parametric estimate of the joint distribution because just ten bins in each of 
L, R and a yields 10 3 free parameters to be determined from 10 4 galaxies. Moreover, Section 3 showed that, 
in each of the SDSS wavebands, the distributions of log 10 R a and log 10 o at fixed absolute magnitude are 
quite well described by Gaussian forms. Therefore, the joint distribution of early-type galaxy luminosities, 
sizes, and velocity dispersions should be well described by a tri-variate Gaussian distribution in the variables 
M = — 2.51og 10 L, R = log 10 i? o and V = log 10 a. Saglia et al. (2001) describe a maximum likelihood 
analysis of early-type galaxy correlations in which they assume that a tri-variate Gaussian is a reasonable 
description of their data: we have the luxury of knowing that this is indeed a reasonable description of our 
dataset. Thus, we have a simple parametrization of the joint distribution for which, in each waveband, nine 
numbers suffice to describe the statistical properties of our sample: three mean values, M», i?* and V„, three 
dispersions, a M , a\ and (jy , and three pairwise correlations, (JrUm Prm, av^M Pvm, and (Jrctv Prv ■ 

In addition, we will also allow for the possibility that the luminosities are evolving — a tenth parameter to 
be estimated from the sample. The maximum likelihood technique allows us to estimate these ten numbers 
as follows. We define the likelihood function 



c = n S ( Zt ) - where 

X = (M -M* + Qz,R-R*,V -V,), 



£ = 
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= (^3/2^/2 Bxp(-l^[ C + g ]-l*). 



(8) 



Similarly to when we discussed the luminosity function, S{zi) is defined by integrating over the range of 
absolute magnitudes, velocities and sizes at Zi which make it into the catalog. Here X is the vector of the 
observables, and £ describes the errors in the measurements. 



Appendix D of Paper I describes how the elements of the error matrix £ were obtained. Briefly, the 
error in the absolute magnitude assumes that there are no errors in the redshift or the K-correction, so 
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Table 1: Maximum-likelihood estimates, in the four SDSS bands, of the joint distribution of luminosities, 
sizes and velocity dispersions. The mean values of the variables at redshift z, M* — Qz, i?*, V*, and the 
elements of the covariance matrix C defined by the various pairwise correlations between the variables are 
shown. These coefficients are also used in computing the matrix T in Paper III. 



Band 




M* 


cm 


i?* 




V* 


<?v 


Prm 


PVM 


Prv 


Q 


9* 


5825 


-20.43 


0.844 


0.520 


0.254 


2.197 


0.113 


-0.886 


-0.750 


0.536 


1.15 


r* 


8228 


-21.15 


0.841 


0.490 


0.241 


2.200 


0.111 


-0.882 


-0.774 


0.543 


0.85 


i* 


8022 


-21.49 


0.851 


0.465 


0.241 


2.201 


0.110 


-0.886 


-0.781 


0.542 


0.75 


z* 


7914 


-21.83 


0.845 


0.450 


0.241 


2.200 


0.110 


-0.885 


-0.782 


0.543 


0.60 



all the error comes from the error on the apparent magnitude nidcv! the error on the circularly averaged 
radius R is given by adding the error on the angular length of the longer axis rd cv to those which come 
from the error on the axis ratio b/a. We assume that the errors in b/a are neither correlated with those in 
log 10 rd C v nor with those in the absolute magnitude. However, because both rridev and rd cv come from the 
same fitting procedure, the errors in M and R are correlated. Finally, we assume that errors in magnitudes 
are not correlated with those in velocity dispersion, so e VM is set to zero, and that errors in size and velocity 
dispersion are only weakly correlated because of the aperture correction we apply. 

The covariance matrix C contains six of the ten free parameters we are seeking. It is these parameters, 
along with the three mean values, M*, i?* and V*, and the evolution parameter Q which are varied until the 
likelihood is maximized. The maximum-likelihood estimates of these parameters in each band are given in 
Table 1. Notice that although the luminosity and size distributions differ from band to band, the velocity 
distributions do not. This is reassuring, because the intrinsic distribution of velocity dispersions, estimated 
from the spectra, should not depend on the band in which the photometric measurements were made. As 
an additional test, we also computed maximum-likelihood estimates of the 2x2 covariance matrices of the 
bivariate Gaussians for the pairs (M,R) and (M, V). These estimates of, e.g., prm and pvm were similar 
to those in Table 1. 

The remainder of this paper uses C to estimate various pairwise correlations. In Paper III, we transform 
the covariance matrix C into one which describes the Fundamental Plane variables of size, surface brightness 
and velocity dispersion. 

4.1. The intrinsic distributions of sizes and velocity dispersions 

Before we present maximum likelihood estimates of various correlations, it is worth remarking that 
because the trivariate Gaussian is a good description of the data, our results indicate that, in addition to 
the intrinsic distribution of absolute magnitudes, the intrinsic distributions of (the logarithms of) early-type 
galaxy sizes and velocity dispersions are also well fit by Gaussian forms. The means and dispersions of these 
Gaussians are given by (i?*, ctJj) and (V*, a v ) in Table 1. Note that the width of the distribution of log 10 a 
is about half that of log 10 R . This is consistent with earlier work (e.g., it is one of the motivations for the 
K-space parametrization of Bender, Burstein & Faber 1992). 
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Table 2: Maximum-likelihood estimates of the slopes Svl, Srl, Sml, Sdl, and Sil, of the relations 
between luminosity and the mean velocity dispersion, effective radius, effective mass, effective density, and 
effective surface brightness at fixed luminosity, as a function of luminosity. The slope of the relation between 
surface brightness and the mean size at fixed surface brightness is Sk- 



Band 



~>VL 



~>RL 



~>ML 



~>DL 



■>IL 



Sk 



4.00±0.25 
3.91±0.20 
3.95±0.15 
3.92±0.15 



1.50±0.06 
1.58±0.06 
1.59±0.06 
1.58±0.06 



0.86±0.02 
0.87±0.02 
0.88±0.02 
0.88±0.02 



-1.20 ±0.08 
-1.33 ±0.07 
-1.34 ±0.08 
-1.33 ±0.07 



-2.98 ±0.16 
-3.78 ±0.17 
-3.91 ±0.18 
-3.80 ±0.17 



-0.73 ±0.02 
-0.75 ±0.02 
-0.76 ±0.02 
-0.76 ±0.01 



4.2. Correlations with luminosity 

As we describe below, appropriate combinations of the coefficients in Table 1 provide maximum like- 
lihood estimates of various linear regressions between pairs of observables which are often studied; these 
are summarized in Table 2. Plots comparing some of these linear regressions with the maximum likelihood 
estimates are shown in Section 3. 

In the Gaussian model, the mean of log 10 o at fixed M is 

V - V*\M - M.) = *v PVM S , (9) 

where the second equality defines Svl, for ease of comparison with equation (3). The dispersion around this 
mean is 

°v\M = °v( l - Pvm)- (10) 

Inserting the values in Table 1 into these expressions for Svl and <r v \ M provides the maximum likelihood 
estimate of the slope and thickness of this relation. These are shown in the second column of Table 2, and 
the fit itself is shown in Figure 4. The errors we quote on the slopes of this, and the other relations in the 
Table, were obtained using subsamples as described in Appendix B. Note that the errors we find in this way 
are comparable to those sometimes quoted in the literature, even though each of the subsamples we selected 
is an order of magnitude larger than any sample available in the literature. 

The mean size at fixed absolute luminosity M, and the dispersion around this mean, are obtained by 
replacing all V's with R's in equation (9). The third column in Table 2 gives the maximum likelihood value 
of the slope Srl, of the size-at-fixed-luminosity relation in the four bands. This fit is shown in Figure 5. 

Similarly, one can show that the slopes of the mean L-mass and L-density relations shown in Figures 7 
and 8 are S M l = {2/S V l + 1/Srl)' 1 and S DL = 1/(2/SVl - 2/ Srl)- 1 . These are the fourth and fifth 
columns of Table 2. The dispersions around these mean mass-i and density-!/ relations can be written in 
terms of the elements of C, though we have not included the expressions here. Even though these relations 
are made from linear combinations of R and V, they may be tighter than either the L — a or L — R Q relations 
because the correlation coefficients Prm, Pvm and prv are different from zero. 

The surface brightnesses of the galaxies in our sample are defined by (p Q — p* ) = (M — ) + 5(R — i?* ) , 
so the dispersion in p is <r^ = a M + Wctm&rPrm + 25a R . The mean surface brightness at fixed luminosity 
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is obtained by replacing all Vs with fis in the equations (9) and (10) above. This means that we need /9 m m, 
which we can write in terms of ctm, ctr and prm- The sixth column in Table 2 gives the slope of the surface 
brightness / at fixed luminosity relation, (I \L) oc L x l SlL , in the four bands. These fits are shown in Figure 9. 



4.3. Inverse relations and other correlations 

So far, we have shown that the maximum likelihood analysis provides estimates of correlations which 
are in good agreement with quantities which can also be estimated by a more straightforward regression 
technique. However, with the coefficients of the correlation matrix C in hand, it is straightforward to 
obtain estimates of correlations which, because of selection effects, cannot be reliably estimated using simple 
regressions. For example, the mean luminosity given the velocity dispersion is 

M - M*\V - V*\ = (y ~ K) g m pvm (11) 
/ ay 

with dispersion cr 2 M (l — Py M ) (compare equations 9 and 10). Inserting the coefficients in Table 1 yields 
(L\a) oc cr 2 ' 34 . Similarly, one can show that (L\R a ) oc i?* 23 and (M /L\M ) oc M o - 22 in r* . 

We can also study correlations which do not involve luminosity. The best studied of these is the 
Kormendy (1977) relation: the surface brightnesses of early-type galaxies decrease with increasing effective 
radius. The mean size at fixed surface brightness in our sample is 

R-R* m-M*) = (M ~ ^ <?RP»R = -QAS k (p-p«). (12) 

where p^R can be written in terms of om, pr and Prm, and the final equality defines Sk- The seventh 
column in Table 2 gives the slope of this relation in the four bands. For comparison, Kormendy (1977) found 
that log 10 I Q oc 1.291og 10 R Q in the B-band, and Pahre et al. (1998) find R a oc I^ 0,61 in the K-band. 

For the reasons described in Section 3, when presented with a magnitude limited catalog, correlations 
at fixed luminosity are useful because they are unbiased by the selection. When luminosity is not one of the 
variables then forward and inverse correlations may be equally interesting, and equally biased. For example, 
in the Kormendy (1977) relation, (R — R* p — p*) may be just as interesting as (p — p*\R — i?*} . The slopes 
of the two relations are, of course, simply related to each other. In fact, it may be preferable to study the 
relations which are defined by the principle axes of the ellipse in (i?, p) space which the galaxies populate. 
The directions of these axes are obtained by computing the eigenvalues and vectors of the covariance matrix 
associated with the sizes and surface brightnesses. To illustrate, the eigenvalues of the 2x2 covariance 
matrix associated with the Kormendy relation are 



where we have set D^r = (a R — cr 2 ) 2 + (2aRa ll p ll R) 2 . The +/— eigenvalues give the dispersions along and 
perpendicular to the major axis of the ellipse. The long axis of the ellipse describes the mean relation, 
(R - R*) = Sk (Po - P*), where 



Sk 



2CT R<T^ p^R 

With obvious changes of variables, analogous expressions can be derived for all the correlations presented 
earlier, although we do not show them here. 
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Fig. 10. — Relation between effective radius and surface brightness. Short dashed lines show forward and 
inverse fits to this relation. The zero-points of these fits are strongly affected by the magnitude limit of 
our sample. To illustrate, solid line shows the maximum-likelihood estimate of the relation in the simulated 
complete catalog from which the magnitude limited catalog, shown by the dotted line, was drawn. 
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Table 3: Coefficients a and (3 which define the projection of minimum scatter, (JmrVi m the space defined 
by absolute magnitude, and the logarithms of the size and velocity dispersion. 



Band 


a 


P 


&MRV 


9* 


0.76 


1.94 


0.063 


r* 


0.79 


1.93 


0.058 


i* 


0.82 


1.89 


0.054 


z* 


0.81 


1.90 


0.054 



The Kormendy relation in our sample is shown in Figure 10. The dashed lines show forward and inverse 
fits to the data: i.e., the mean size at fixed surface brightness, and the mean surface brightness at fixed size. 
The parameters of the fits are affected by the magnitude limit of the catalog. To estimate the effect of the 
magnitude limit cut on this relation, we compute the direct and inverse fits to the Kormendy relation in the 
simulated complete and magnitude-limited samples we describe in Appendix A. The dotted line in Figure 10 
shows the direct fit to the magnitude limited simulations (it can hardly be distinguished from the fit to the 
data). 

In comparison, the maximum-likelihood estimate of the true direct relation provides a very good de- 
scription of the relation in the complete simulations in which there is no magnitude limit: it is shown as the 
solid line. Notice that the dashed and dotted lines have approximately the same slope as the solid line: the 
magnitude limit hardly affects the slope, although it changes the zero-point dramatically. At fixed surface 
brightness, the typical R is significantly larger in the magnitude limited sample than in the complete sample. 
This happens because lines of constant luminosity run downwards and to the right with slope —1/5, so that 
changes in luminosity act approximately perpendicular to the relation. 

This shows that although linear regression fits to the data provide good estimates of the true slope of 
the Kormendy relation, they provide bad estimates of the true zero-point. In comparison, the maximum- 
likelihood technique, which accounts for the selection on apparent magnitudes, is able to estimate the slope 
and the zero-point correctly. 

Another interesting correlation is that between the effective mass and density defined in equations (4) 
and (6). A little algebra shows that (A |M ) oc M~ 52 . Figure 11 shows forward and inverse fits to this 
relation. The characteristic density A c of halos seen in numerical simulations of hierarchical clustering scales 
with halo mass: (A c |m) <~ 9 3 (m/m*) -0 ' 4 (Bullock et al. 2001), which is qualitatively similar to the scaling 
of effectve density with effective mass in our sample. The scatter in characteristic densities at fixed halo 
mass, ~ 0.33 dex, is also rather similar to the scatter in effective densities at fixed mass. These coincidences 
may provide important clues to how early-type galaxies formed. 

In contrast to the Faber- Jackson, radius-luminosity, Kormendy, and mass-density relations, the re- 
lations between luminosity and mass and luminosity and density involve three variables. Is there some 
combination of these variables which provides the least scatter? The eigenvectors and eigenvalues of the 
3x3 matrix C give the directions of the principle axes of the ellipsoid in (M, R, V) space which the early- 
type galaxies populate. One of the eigenvalues of C is considerably smaller than the others, suggesting that 
the galaxies populate a two-dimensional plane in (M, R, V) space. The eigenvectors show that the plane is 
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Fig. 11. — Relation between effective density and effectve mass. Short dashed lines show forward and inverse 
fits to this relation. The more massive galaxies are less dense. Text in top right of each panel shows the 
slopes of the dashed lines. 
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viewed edge-on in the projection 

-0.4(M - M») = a (fl - + /? (V - V*), (13) 

where M*, i?* and V* were given in Table 1, and the coefficients a and (3, and the thickness of the plane in 
this projection, ctmrv are given in Table 3. Section 3 shows that a scatter plot of luminosity versus mass 
is considerably tighter than plots of M versus log 10 R a or log 10 a. The eigenvectors of C show that this is 
because the M versus R + 2V projection is actually quite close to the edge-on projection. It is interesting 
that this plane is only about 10% thicker than the Fundamental Plane relation between R a , I Q and a which 
is the subject of Paper III. 

5. Discussion and conclusions 

We have studied the properties of ~ 9000 early- type galaxies over the redshift range < z < 0.3 using 
photometric (the g* , r* , i* and z* bands) and spectroscopic observations. The intrinsic distributions of 
luminosity, velocity dispersion and half-light radius of the galaxies in our sample are each well described by 
Gaussians in absolute magnitude, log 10 <r, and log 10 -R o - 

A maximum likelihood analysis of the joint distribution of luminosities, sizes and velocity dispersions 
suggests that the population at higher redshifts is slightly brighter than the population nearby, and that the 
change with redshift is faster in the shorter wavebands: If M*(z) = M*(0) — Qz, then Q = 1.15, 0.85 and 
0.75 in g* , r* and i*. This evolution is sufficiently weak that, relative to their values at the median redshift 
(z <~ 0.15) of our sample, the sizes, surface brightnesses and velocity dispersions of the early-type galaxy 
population at lower and higher redshifts has evolved little. The fact that we see so little evolution in the 
luminosities argues for a relatively high formation redshift: Bruzual & Chariot (2003) single burst stellar 
population synthesis models indicate that iform ~ 9 Gyrs. This is consistent with the model we use to make 
K-corrections in Paper I, and is also consistent with the formation time estimates based on the Fundamental 
Plane in Paper III, and galaxy colors and spectral line indices in Paper IV. 

We find that (cr\L) oc L 1 / 4 and (R \L) cx L 3 / 5 (see Table 2 for the exact coefficients, and Figures 4 and 5 
for the fits). Galaxies which are slightly larger than expected (given their luminosity) have smaller velocity 
dispersions than expected (Figure 6). This is expected if galaxies are in virial equilibrium. 

A plot of luminosity versus effective mass M = 2R a 2 /G is substantially tighter than either the L — a 
or the L — R Q relations. It has a slope which is slightly shallower than unity. In particular, on scales of a few 
kiloparsecs, L oc M°- 86 , approximately independent of waveband (Figure 7). This complements recent SDSS 
weak-lensing analyses (McKay et al. 2001) which suggest that mass is linearly proportional to luminosity in 
these same wavebands, but on scales which are two orders of magnitude larger (~ 260ft.~ 1 kpc). Together, 
these two measurements of the mass-to-light ratio can be used to provide a constraint on the density profiles 
of dark matter halos. 

A plot of luminosity versus effective density A Q oc a 2 /R 2 shows that (A |L) oc L~ 3 / 4 (Figure 8). 
Moreover, a maximum likelihood analysis suggests that the more massive galaxies are less dense: (A |M ) oc 
M~ ' 52 (Figure 11). This is qualitatively similar to a trend seen in numerical simulations of hierarchical 
clustering: more massive halos tend to be less centrally concentrated (Navarro, Frenk & White 1997). This 
coincidence may provide an important clue to how early-type galaxies formed. 

The Kormendy relation between size and surface brightness has approximately the same slope (R \I ) oc 
Ig ' 77 in all four SDSS bands (Figure 10). Our maximum likelihood analysis, and measurements made in 
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mock catalogs which reproduce all the observed scalings of the dataset (a procedure for generating such 
catalogs is described in Appendix A), show that the zero-point of this relation is strongly affected by the 
magnitude limit of the sample (Section 4.3). 
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This Appendix describes how to use our knowledge of the covariance matrix C to simulate mock galaxy 
samples which have the same correlated observables as the data. We use these mock samples to estimate 
the effect of the magnitude limit cut on the relations we wanted to measure in the main text. 

The observed parameters L, R a and a of each galaxy in our sample are drawn from a distribution, say, 
(j>(M, R, V\z), where M is the absolute magnitude, R = log 10 R Q and V — log 10 a. We show in Section 3 
that 4>{M, R, V\z) = p(R,V\M, z) (f>(M\z), where (f>(M\z) is the luminosity function at redshift z, and the 
distribution of R and V at fixed luminosity is, to a good approximation, a bivariate Gaussian. The maximum 
likelihood estimates of the parameters of the luminosity function and of the bivariate distribution at fixed 
luminosity can be obtained from Table 1. 

To make the simulations we must assume that, when extrapolated down to luminosities which we do 
not observe, these relations remain accurate. Assuming this is the case, we draw M from the Gaussian 
distribution that we found was a good fit to <fi(M\z) (Section 2). We then draw R from the Gaussian 
distribution with mean (R\M) and dispersion <J^ M - Finally, we draw V from a Gaussian distribution with 
mean and variance which accounts for the correlations with both M and R. In practice we draw three zero 
mean unit variance Gaussian random numbers: go, <7i, an d 32, and then set 



A. Simulating a complete sample 



M = M*+a M go, 



R 




and 



This preprint was prepared with the AAS IAT15X macros v5.0. 
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a V\RM 




where 



Because each simulated galaxy is assigned a luminosity and size, its surface brightness is also fixed: fi = 
M + 5R+ constant. 

If we generate a catalog in r* , then we can also generate colors using the parameters given in Tabic 1 of 
Paper IV. Specifically, generate a Gaussian variate 53, and then set C = C* +£cm(M — M*)/(tm + £cv(V — 
V*)/av + g3&c\MV, where £cm, £cv and a c \uv are defined analogously to £mv, Crv and ay\ RM above . 
Inserting the values from Table 1 of Paper IV shows that £cm ~ 0, and o c \mv ~ a c\v = cr c\/l — Pvm'- 
the mean color is determined by the velocity dispersion and not by the absolute magnitude. 

Passive evolution of the luminosities and colors is incorporated by adding the required z dependent shift 
to M and C after the sizes and velocity dispersions have been generated. 

This complete catalog can be used to simulate a magnitude limited catalog if we assign each mock 
galaxy a redshift, assuming a world model and homogeneity. Let m m i n and m max denote the apparent 
magnitude limits of the observed sample. Let -/^Bright denote the absolute magnitude of the most luminous 
galaxy we expect to see in our catalog. Because the luminosity function cuts off exponentially at the bright 
end, we can estimate this by setting Meright ~ M* + 5<7m- This means that the most distant object 
which can conceivably make it into the magnitude limited catalog lies at a luminosity distance of about 
4max = 1 0(" lmax ~ MBri s ht ~ 25 )/ 5 , from which the maximum redshift z max can be determined. If the comoving 

number density of mock galaxies is to be independent of redshift, we must assign redshifts as follows. Draw 

1/3 

a random variate u\ distributed uniformly between zero and one, and set doom = U\ 4max/ (1 + z max ) . The 
redshift z can be obtained by inverting the dc O m{z;0,,A) relation. The apparent magnitude of this mock 
galaxy is m — M + 5Logi dL + 25 + K(z), where K(z) is the K-correction. If m m i n < m < m max , then this 
galaxy would have been observed; add it to the subset of galaxies from the complete catalog which would 
have been observed in the magnitude limited catalog. 

If our simulated catalogs are accurate, then plots of magnitude, size, surface-brightness and velocity 
dispersion versus redshift made using our magnitude-limited subset should look very similar to the SDSS 
dataset shown in Figure 12 of Paper I. In addition, dN/dz in the simulated magnitude limited subset should 
be similar to that in Figure 3. Furthermore, any correlations between observables in the magnitude limited 
subset should be just like those in the actual SDSS dataset. If they are, then one has good reason to assume 
that similar correlations measured in the complete, rather than the magnitude-limited simulation, represent 
the true correlations between the parameters of SDSS galaxies, corrected for selection effects. In this way, 
the simulations allow one to estimate the impact that the magnitude-limited selection has when estimating 
correlations between early-type galaxy observables. 

We have verified that our simulated magnitude limited catalogs have similar dN/dz distributions to 
those observed, and the simulated o and R versus z plots show the same selection cuts at low velocities 
and sizes as do the observed data. The distribution of apparent magnitudes, angular sizes, and velocity 



- 29 - 



dispersions in the magnitude limited simulations are very similar to those in the real data. The simulated 
parameters also show the same correlations at fixed luminosity as the data. Maximum likelihood analysis on 
the simulations produces an estimate of the covariance matrix which is similar to that of the data. Therefore, 
we are confident that our simulated complete catalogs have correlations between luminosity, size, and velocity 
dispersion which are similar to the data. 

B. Composite volume-limited catalogs 

Our parent sample is magnitude limited; unless accounted for, this will introduce a bias into a number 
of correlations we study in this series of papers. For this reason, we often present results measured in a few 
volume limited subsamples. Because of the cuts at both the faint and the bright ends of the catalog, each 
volume-limited subsample used in the main text spans only a small range in luminosity. However, because 
the galaxies in our sample luminosity show little or no evolution relative to the values at the median redshift 
of the sample, we can extend this range in either of three ways. 

One method is to construct a composite volume-limited catalog by stacking together smaller volume- 
limited subsamples which are adjacent in redshift and in luminosity, but which do not overlap at all. Let 
Vi denote the volume of the ith subsample, and let Ni denote the number of galaxies in it. A conservative 
approach is to randomly choose the galaxies in Vi with probability proportional to mm(Vi)/Vi, where min(Vi) 
denotes the volume of the smallest of the subsamples. This has the disadvantage of removing much of the 
data, but, because our data set is so large, we can afford this luxury. A more cavalier approach is to choose all 
the galaxies in the largest Vj, all the galaxies in the other Vj, and to generate a set of additional galaxies by 
randomly choosing one of the Nj galaxies in Vj , adding to each of its observed parameters a Gaussian random 
variate with dispersion given by the quoted observational error, and repeating this Nj x [max(Vj)/Vj — 1] 
times. A final possibility is to weight all the galaxies in V, (even those which were not in the volume limited 
subsample) by the inverse of the volume in which they could have been observed (V max — V m i n ). We chose 
the first, most conservative option. 

By piecing together three volume limited subsamples, we were able to construct composite catalogs of 
about 10 3 objects each. Because the completeness limits are different in the different bands, the composite 
catalogs are different for each band. In addition, because any one composite catalog is got by subsampling 
the set of eligible galaxies, by subsampling many times, we can generate many realizations of a composite 
catalog. This allows us to estimate the effects of sample variance on the various correlations we measure. 



